Your download's being scanned. We'll let you know if there's an issue.
Twitter sentiment analysis.ipynb
No Headings
The table of contents shows headings in notebooks and supported files.
- File
- Edit
- View
- Run
- Kernel
- Settings
- Help
Kernel status: Idle
[1]:
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.preprocessing import LabelEncoder
from sklearn.ensemble import RandomForestClassifier
from sklearn.naive_bayes import MultinomialNB
from sklearn.ensemble import GradientBoostingClassifier
from sklearn.metrics import accuracy_score, classification_report
import spacy
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.pipeline import Pipeline
import seaborn as sns
import matplotlib.pyplot as plt
[3]:
nlp=spacy.load("en_core_web_sm")
[5]:
df=pd.read_csv("C:/Users/rish2/Downloads/archive/twitter_training.csv")
df
[5]:
| 2401 | Borderlands | Positive | im getting on borderlands and i will murder you all , | |
|---|---|---|---|---|
| 0 | 2401 | Borderlands | Positive | I am coming to the borders and I will kill you... |
| 1 | 2401 | Borderlands | Positive | im getting on borderlands and i will kill you ... |
| 2 | 2401 | Borderlands | Positive | im coming on borderlands and i will murder you... |
| 3 | 2401 | Borderlands | Positive | im getting on borderlands 2 and i will murder ... |
| 4 | 2401 | Borderlands | Positive | im getting into borderlands and i can murder y... |
| ... | ... | ... | ... | ... |
| 74676 | 9200 | Nvidia | Positive | Just realized that the Windows partition of my... |
| 74677 | 9200 | Nvidia | Positive | Just realized that my Mac window partition is ... |
| 74678 | 9200 | Nvidia | Positive | Just realized the windows partition of my Mac ... |
| 74679 | 9200 | Nvidia | Positive | Just realized between the windows partition of... |
| 74680 | 9200 | Nvidia | Positive | Just like the windows partition of my Mac is l... |
74681 rows × 4 columns
[7]:
columns =["ID","Entity","Label","Text"]
df=pd.read_csv("C:/Users/rish2/Downloads/archive/twitter_training.csv",names=columns)
df
[7]:
| ID | Entity | Label | Text | |
|---|---|---|---|---|
| 0 | 2401 | Borderlands | Positive | im getting on borderlands and i will murder yo... |
| 1 | 2401 | Borderlands | Positive | I am coming to the borders and I will kill you... |
| 2 | 2401 | Borderlands | Positive | im getting on borderlands and i will kill you ... |
| 3 | 2401 | Borderlands | Positive | im coming on borderlands and i will murder you... |
| 4 | 2401 | Borderlands | Positive | im getting on borderlands 2 and i will murder ... |
| ... | ... | ... | ... | ... |
| 74677 | 9200 | Nvidia | Positive | Just realized that the Windows partition of my... |
| 74678 | 9200 | Nvidia | Positive | Just realized that my Mac window partition is ... |
| 74679 | 9200 | Nvidia | Positive | Just realized the windows partition of my Mac ... |
| 74680 | 9200 | Nvidia | Positive | Just realized between the windows partition of... |
| 74681 | 9200 | Nvidia | Positive | Just like the windows partition of my Mac is l... |
74682 rows × 4 columns
<class 'pandas.core.frame.DataFrame'> RangeIndex: 74682 entries, 0 to 74681 Data columns (total 4 columns): # Column Non-Null Count Dtype --- ------ -------------- ----- 0 ID 74682 non-null int64 1 Entity 74682 non-null object 2 Label 74682 non-null object 3 Text 73996 non-null object dtypes: int64(1), object(3) memory usage: 2.3+ MB
[15]:
Entity LeagueOfLegends 2377 MaddenNFL 2377 CallOfDuty 2376 Verizon 2365 TomClancysRainbowSix 2364 Facebook 2362 Microsoft 2361 Dota2 2359 WorldOfCraft 2357 ApexLegends 2353 NBA2K 2343 CallOfDutyBlackopsColdWar 2343 johnson&johnson 2324 FIFA 2324 TomClancysGhostRecon 2321 Overwatch 2316 Battlefield 2316 GrandTheftAuto(GTA) 2293 HomeDepot 2292 PlayStation5(PS5) 2291 Hearthstone 2286 CS-GO 2284 Xbox(Xseries) 2283 Borderlands 2280 Amazon 2276 Google 2274 Nvidia 2271 Cyberpunk2077 2262 RedDeadRedemption(RDR) 2249 Fortnite 2249 AssassinsCreed 2234 PlayerUnknownsBattlegrounds(PUBG) 2234 Name: count, dtype: int64
[19]:
Label Negative 22358 Positive 20655 Neutral 18108 Irrelevant 12875 Name: count, dtype: int64
1: im getting on borderlands and i will murder you all , -> Positive 2: I am coming to the borders and I will kill you all, -> Positive 3: im getting on borderlands and i will kill you all, -> Positive 4: im coming on borderlands and i will murder you all, -> Positive 5: im getting on borderlands 2 and i will murder you me all, -> Positive 6: im getting into borderlands and i can murder you all, -> Positive 7: So I spent a few hours making something for fun. . . If you don't know I am a HUGE @Borderlands fan and Maya is one of my favorite characters. So I decided to make myself a wallpaper for my PC. . Here is the original image versus the creation I made :) Enjoy! pic.twitter.com/mLsI5wf9Jg -> Positive 8: So I spent a couple of hours doing something for fun... If you don't know that I'm a huge @ Borderlands fan and Maya is one of my favorite characters, I decided to make a wallpaper for my PC.. Here's the original picture compared to the creation I made:) Have fun! pic.twitter.com / mLsI5wf9Jg -> Positive 9: So I spent a few hours doing something for fun... If you don't know I'm a HUGE @ Borderlands fan and Maya is one of my favorite characters. -> Positive 10: So I spent a few hours making something for fun. . . If you don't know I am a HUGE RhandlerR fan and Maya is one of my favorite characters. So I decided to make myself a wallpaper for my PC. . Here is the original image versus the creation I made :) Enjoy! pic.twitter.com/mLsI5wf9Jg -> Positive
[31]:
| ID | Entity | Label | Text | Preprocessed Text | |
|---|---|---|---|---|---|
| 0 | 2401 | Borderlands | Positive | im getting on borderlands and i will murder yo... | m get borderland murder |
| 1 | 2401 | Borderlands | Positive | I am coming to the borders and I will kill you... | come border kill |
| 2 | 2401 | Borderlands | Positive | im getting on borderlands and i will kill you ... | m get borderland kill |
| 3 | 2401 | Borderlands | Positive | im coming on borderlands and i will murder you... | m come borderland murder |
| 4 | 2401 | Borderlands | Positive | im getting on borderlands 2 and i will murder ... | m get borderland 2 murder |
| ... | ... | ... | ... | ... | ... |
| 74677 | 9200 | Nvidia | Positive | Just realized that the Windows partition of my... | realize Windows partition Mac like 6 year Nvid... |
| 74678 | 9200 | Nvidia | Positive | Just realized that my Mac window partition is ... | realize Mac window partition 6 year Nvidia dri... |
| 74679 | 9200 | Nvidia | Positive | Just realized the windows partition of my Mac ... | realize window partition Mac 6 year Nvidia dri... |
| 74680 | 9200 | Nvidia | Positive | Just realized between the windows partition of... | realize window partition Mac like 6 year Nvidi... |
| 74681 | 9200 | Nvidia | Positive | Just like the windows partition of my Mac is l... | like window partition Mac like 6 year driver i... |
73996 rows × 5 columns
[35]:
| ID | Entity | Label | Text | Preprocessed Text | |
|---|---|---|---|---|---|
| 0 | 2401 | Borderlands | 3 | im getting on borderlands and i will murder yo... | m get borderland murder |
| 1 | 2401 | Borderlands | 3 | I am coming to the borders and I will kill you... | come border kill |
| 2 | 2401 | Borderlands | 3 | im getting on borderlands and i will kill you ... | m get borderland kill |
| 3 | 2401 | Borderlands | 3 | im coming on borderlands and i will murder you... | m come borderland murder |
| 4 | 2401 | Borderlands | 3 | im getting on borderlands 2 and i will murder ... | m get borderland 2 murder |
| ... | ... | ... | ... | ... | ... |
| 74677 | 9200 | Nvidia | 3 | Just realized that the Windows partition of my... | realize Windows partition Mac like 6 year Nvid... |
| 74678 | 9200 | Nvidia | 3 | Just realized that my Mac window partition is ... | realize Mac window partition 6 year Nvidia dri... |
| 74679 | 9200 | Nvidia | 3 | Just realized the windows partition of my Mac ... | realize window partition Mac 6 year Nvidia dri... |
| 74680 | 9200 | Nvidia | 3 | Just realized between the windows partition of... | realize window partition Mac like 6 year Nvidi... |
| 74681 | 9200 | Nvidia | 3 | Just like the windows partition of my Mac is l... | like window partition Mac like 6 year driver i... |
73996 rows × 5 columns
Shape of X_train: (59196,) Shape of X_test: (14800,)
[43]:
Pipeline(steps=[('vectorizer_tri_grams', TfidfVectorizer()),
('naive_bayes', MultinomialNB())])In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.
Pipeline(steps=[('vectorizer_tri_grams', TfidfVectorizer()),
('naive_bayes', MultinomialNB())])TfidfVectorizer()
MultinomialNB()
0.7322972972972973
precision recall f1-score support
0 0.95 0.46 0.62 2575
1 0.65 0.90 0.76 4472
2 0.84 0.64 0.72 3622
3 0.71 0.81 0.76 4131
accuracy 0.73 14800
macro avg 0.79 0.70 0.71 14800
weighted avg 0.77 0.73 0.72 14800
[53]:
Pipeline(steps=[('vectorizer_tri_grams', TfidfVectorizer()),
('naive_bayes', RandomForestClassifier())])In a Jupyter environment, please rerun this cell to show the HTML representation or trust the notebook. On GitHub, the HTML representation is unable to render, please try loading this page with nbviewer.org.
Pipeline(steps=[('vectorizer_tri_grams', TfidfVectorizer()),
('naive_bayes', RandomForestClassifier())])TfidfVectorizer()
RandomForestClassifier()
0.9118918918918919
precision recall f1-score support
0 0.97 0.86 0.91 2575
1 0.92 0.93 0.93 4472
2 0.94 0.89 0.91 3622
3 0.85 0.94 0.90 4131
accuracy 0.91 14800
macro avg 0.92 0.91 0.91 14800
weighted avg 0.91 0.91 0.91 14800
[63]:
| ID | Entity | Label | Text | |
|---|---|---|---|---|
| 0 | 2401 | Borderlands | Positive | im getting on borderlands and i will murder yo... |
| 1 | 2401 | Borderlands | Positive | I am coming to the borders and I will kill you... |
| 2 | 2401 | Borderlands | Positive | im getting on borderlands and i will kill you ... |
| 3 | 2401 | Borderlands | Positive | im coming on borderlands and i will murder you... |
| 4 | 2401 | Borderlands | Positive | im getting on borderlands 2 and i will murder ... |
-
Variables
Callstack
Breakpoints
Source
9
1
Kernel Sources
Common Tools
No metadata.
Advanced Tools
No metadata.
Anaconda Assistant
AI-powered coding, insights and debugging in your notebooks.
To enable the following extensions, create an account or sign in.
- Anaconda Assistant4.1.0
- Coming soon!
- Data Catalogs
- Panel Deployments
- Sharing
Already have an account? Sign In
For more information, read our Anaconda Assistant documentation.
![Python [conda env:base]](./Twitter sentiment analysis_files/logo-64x64.png)